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ABSTRACT 

We present an estimate of the projected two-point correlation function (2PCF) of quasars in the 
Sloan Digital Sky Survey (SDSS) over the full range of one- and two-halo scales, 0.02 h~^ Mpc < r^ < 
120 h^ ^ Mpc. This was achieved by combining data from SDSS DR7 on large scales and lHennawi et aD 
(|2006l with appropriate statistical corrections) on small scales. Our combined clustering sample is the 
largest spectroscopic quasar clustering sample to date, containing ~ 48,000 quasars in the redshift 
range 0.4 '^ z < 2.5 with median redshift 1.4. We interpret these precise 2PCF measurements within 
the halo occupation distribution (HOD) framework and constrain the occupation functions of central 
and satellite quasars in dark matter halos. In order to explain the small-scale clustering, the HOD 
modeling requires that a small fraction of z ■^ 1.4 quasars, /sat = (7.4 ± 1.4) x 10~*, be satellites in 
dark matter halos. At z '--^ 1.4, the median masses of the host halos of central and satellite quasars 
are constrained to be M^cn = ^■'^tl'l x 10^^ /j-i y[^ a,nd M^at = 3.6t?o x 10^" h''^ Mq, respectively. 
To investigate the redshift evolution of the quasar-halo relationship, we also perform HOD modeling 
of the projected 2PCF measured by Shen et al. (2007 ) for SDSS quasars with median redshift 3.2. 
We find tentative evidence for an increase in the mass scale of quasar host halos — the inferred median 
mass of halos hosting central quasars at z '^ 3.2 is Mcen = 14.1^6 g ^ l^^^ h~^ M©. The cutoff profiles 
of the mean occupation functions of central quasars reveal that quasar luminosity is more tightly 
correlated with halo mass at higher redshifts. The average quasar duty cycle around the median host 
halo mass is inferred to be /q = 7.3t?:5 x 10"'' at z ~ 1.4 and /q = S.&tf,2' x 10"^ at z ~ 3.2. We 
discuss the implications of our results for quasar evolution and quasar-galaxy co-evolution. 



Subject headings: galaxies: nuclei 
matter 



1. INTRODUCTION 



quasars: general — large-scale structure of universe — dark 



Quasars are a highly luminous class of active galac- 
tic nuclei (AGN) beli eved to be powered by supermas- 
sivc black holes (e.g.. ISalDetedll964t iLvnden- Bell 196 
iKormendv fc Richstond 119951: 



Richstone et al. 19981). Wf 



observed bi-modality of galaxy color, as strong radia- 
tive feedback from quasars blows out gas from galax- 
ies a n d suppresses star form a tion ( e. g.. ISilk fc Ree^ 



Theoretical studies and observational evidence indi- 
cate that an epoch of quasar activity occurred during 
the formation of every r nassive elliptical gal axy (e.g., 
iSanders fc Mirabell [l99l iHopkins et all [20Q8h . Corre- 
lations betw een blac k hole mass an d host galaxy prop- 
erties (e.g.. iMaeorr ian et al. 19 981: iFterrarese fc Merritp 
2000t iGebh ardt et al. 2000 : iMerritt fc Ferrar esd i2001f 
Graham et al. 2002; TremaineeLalJ |2002t |King 200"" 



1998"; 'Kauffmann & Haehnclt' '2000a^ 'Volontcri et alj 
2003; Wvithc & Loeb 2003; Adclbcrgcr & Steidel 200a 
Di Matteo et al.ll2005HCroton et al.ll2006HHopkins et atl 



2006al) . 

Galaxies are known to reside in dark matter halos 
and, as s uch, probe structure formati on in the uni- 
verse fe.g.. [White fc Frenklll991l: Kauffm ann et al.lll993l: 
Navarro eta l."1995': 'Mo fc Whiteiri996HKauffmann et al.l 



suggest that black holes and galaxy spheroids evolve via 
a common physical process (e.g., iHopkins et al. 2008) . 
More generally, it has been sugg ested that there ex- 
ists a black hole fundamental plane (jHopkins et al.|[2007l) 
anal ogous to the fundainental plane for elliptical g alax- 
ies ( Dressier et~all 119871: [Diorgovski fc DavisI 119871 ). al- 
though recent results provide on ly weak evidence for 
its exi stence (jBeifiori et al.l l2012[ ). Combining these 
ideas, IHopkins et al.l (|2008l ) showed using a hydrody- 
namic simulation that in a major merger-driven sce- 
nario, galaxies evolve from an ultraluminous infrared 
galaxy to an elliptical galaxy through a quasar phase. 
In this scenario quasars play an important role in the 



1999t ISpringel et al. 2005b) . Since supermassive black 
holes are believed to reside a t the centers of mas- 
sive galaxies (e.g.. ISoltanlll982l) . a connection between 
black holes and their host dark matter halos is nat- 
urally expected. It has recently been suggested that 
this connection may be indirect, however, as supermas- 
sive black hole masses have been f ound to strongly cor- 
relate only with bulge properties (jKormendv fc Bended 
120111) . The black hole-halo relationship has been studied 
in analytic models and cosmological hydrodynami c sim- 
ulations (.Booth fc Schave 2 009, 2010; Di Matteo iTaD 
l2008at IVolonteri et al.l[201l! ). Obscrvationally, quasars 
provide a powerful tool for studying this relation. 

Because of their high luminosity, quasars are detected 
to z > 7 (e.g.. lMortlock et al.ll201l[ ). making them excel- 
lent probes of structure formation over cosmic time. The 



SDSS 



Richardson et al. 



TABLE 1 
)UASAR Clustering Samples 



Sample 


z 


2 


Flux Limit 


Area [deg^] 


A^QSO 


TTmax [h ^ Mpc] 


Tp [h-i Mpc] 


SDSS DR7 


0.4 < z < 2.5 


1.4 


i< 19.1 


6248 


47,699 


80.0 


1.0 <rp< 120 


Shen et al. (2007) 


2.9 < 2 < 5.4 


3.2 


i < 20.2 


4041 


4426 


100.0 


1.3 < rp < 211 


Hennawi et al. (2006) 


0.7 < 2 < 3.0 


1.6 


i < 19.1/21.0 




386 


~22 


0.02 <rp < 7.0 



Note. — i^or eacn sample, tne z column inaicates tne reasniit range, z is tnc meaian reasniit, jvqs 
is the upper limit of the line-of-sight pair separation for computing the projected 2PCF [see eq. JTJ], and the rp column is the (comoving) 
transverse pair separation range. 



spatial clustering of quasars can be used to probe the re- 
lationship between quasars and their host dark matter 
halos, providing constraints on the formation and evo- 
lution of quasars and their role in galaxy formation. In 
this paper, we present clustering measurements of a large 
spectroscopic sample of quasars (~ 48,000 quasars in 
the redshift range 0.4 5. z S, 2.5) froin the Sloan Digi- 
tal Sky Survey (SDSS: lYork et alil200(il ). spanning scales 
- 0.02 h-^ Mpc to -- 120 h-^ Mpc, and perform the- 
oretical modeling of the clustering to infer the relation 
between quasars and dark matter halos. 

Quasar clustering is typically measured t hrough th e 
two-point correlation function (2PCF; e.g., lArdll970l ). 
The advent o f the 2dF Q uasi-Stellar Object Redshift 
Survey (2QZ; iCroom et alJr2004 ') and the SDSS has en- 
abled high precision measurements of the 2PCF (e.g.. 



Pprciani et al. 2004; Groom et al. 2005; da Angela et al.l 
2005; Myers et al. 2006 , 2007 a. b; P o rciani fc Norberg 
[20071 [20081 [2001 Ida Angela et al 



2006; Sh en et al, 
200& ,Ross et al.l 
20121 ). It was found that quasars become increas- 



20091: (Hickox et aLll2011t IWhite et a" 



ingly biased relative to the und erlying dark mat- 
ter with increasing r edshift (e.g., ICroom et al.l 120011: 
Porciani e lTall 120041: ICroom etall 120051 : iMvers et all 



2007a; .Shen et al.l l2007l ). and that quas ar clustering; 
only w -eakly depends on lu minosit y ('e.g.. ICroorn et al] 
120051 [Poreiani fc Nor berg l2006t iMvers et al.l l2007al: 
Ida Angela eTaJ. 2008: Shen et al.l[2009t ). 

With an assumed cosmological model, the relative am- 
plitude of the large-scale quasar 2PCF and dark mat- 
ter 2PCF (i.e., the square of the quasar bias) can be 
inferred. By rela ting this bias factor to tha t of dark 
matter halos (e.g.. lJinidil998: She"tiret al.ll200H) . a rough 
estimate of the typical mass of a quasar-hosting dark 
matter halo can then be obtained. Observed quasar clus- 
tering shows that the typical halo mass is in the range 
of 10^^ - 10^^ h~^ MfT,, inde pendent of reds hift and lu- 
minosity (e.g.. |Porciani et al. 2004: Crooni et all 120051: 
Mvers et al.) I2006L l2007at iCoil et al.l l2007t iShen et al.l 



20071: Ida Angela et al.' '2008': 'Padma nabhan et al.l [200l 



Hickox et a l. 2009; White ct al. 2012). Theoretically, the 



characteristic mass of halos hosting quasars may be re- 
lated to the critical mass beyond w hich halos do not have 
large reservoirs of cold gas (e .g., iHopkins et al.l l2006al : 
ICrotonl[2009HKeres et al.ll2009t ). From the quasar abun- 
dance, the abundance of halos within the host halo mass 
range, and the halo formation time, the quasar lifetime 
(and duty cycle) can also be cons t rained (Cole fc Kaiseil 
1989t iMartini fc WeinbeT^ [200H [l aiman fc Huil 120011: 



Shankar et al.ll2009Ll201o3lbl . 120111) . 

Measurement of the small-scale quasar clustering is 



usually hindered by fiber collisions in fiber-based spec- 
troscopic surveys. On a single spectroscopic plate in the 
SDSS, the finite size of the fiber plugs prevents any two 
fibers from being placed within ^ 55" of each other, cor- 
responding to a comoving separatio n of ~ 1 ft.~^ Mpc at 
the typical quasar redshift of 1.5. IMvers et al.l ()2006f ) 
circumvented this problem by measuring the angular 
clustering of photometrica l ly cla ssified quasars on all 
scales, and iHennawi et al.l (|2006l hereafter IIE06) ob- 
tained the first measurement of the real-space quasar 
clustering on small scales by detecting close quasar bi- 
naries through follow-up spectroscopy of SDSS quasars. 
IIE06 detected excess clustering at small scales over ex- 
trapolations of the pow er law from l arge scales. This 
result was confirmed by IMvers et al.l (|2008[ ) , who found 
a slightly smaller excess than HE06 using a more ho- 
mogeneously selected sample. At small scales, quasar 
clustering probes the distribution of quasars within dark 
matter halos. It can advance our understanding of the 
triggering processes that drive quasar accretion (e.g., 
IHopkins et al.l [20081 ) . The distribution of quasars inside 
halos inferred from small-scale clustering can also provide 
insight into black hole mergers and bi n ary AGN (e.g. , 
Cpmerford ct al." '2009'; 'Liu et alj [20ll iFu et al.l [201l[ : 
Liu ct al. 2011; McGurk ct al. 20ll|). 

Accurate inferences of physical parameters, such as 
the host halo mass, satellite fraction, and quasar life- 
time, rely on precision clustering measurements. Spe- 
cific predictions from theory (e.g., re dshift and lu- 
minosity evolution of quasar hfetimes: ICole fc Kaiseij 
1989; Martini fc W einberdl200"llShenll2009HCrotonll2009r 
Bonoli et al.. .2009.) require precision measurements for 
informative tests. At present, large uncertainties in 
the host halo mass scale prevent firm establishm ent of 
the " no redshift evolution" scenario (e.g., Hopkins et al.l 
[2008t [Hickox et al1l2009| ). Clustering of X-ray AGN sug- 
gests that the host halos of X-ray AG N are more mas- 
sive than the h osts of quasars (e.g., iGilli et aD 120051 : 
ICoil et al]l2009l ). but the constrained mass range is too 
large for a definite co nclusion (for a recent review, see 
iCapDelluti etaiMoT^ . 

We now present precise quasar 2PCF measurements 
covering small to large scales and model the halo occu- 
pation distribution (HOD) to obtain accurate constraints 
on the host halo mass scale of quasars and its redshift 
evolution. We also obtain the quasar satellite fraction 
and the quasar duty cycle from our HOD analysis. The 
HOD provides a powerfu l theoretical framework (e.g., 
iBerlind fc Weinberg||2003 1 for understanding the cluster- 
ing properties of any biased tracer of mass. It has been 
used extensively for modeling and understa. nding galaxy 
clustering (e.g.. ISeliakI l200o[ IZheng et al.l [2005i i2007l 
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120091) and extended to study AGN an d quasar clustering 



[e^S, 



) and extenaed to study AUIN an d quasar clu stering 

iPorciani 61101120041: 1 Wake et al.i2008: Mivaii et alj 

iMltlShen et al.ll2010l: iStarikova et al.ll2011h . The work 
presented in this paper adopts, for the first time, a theo- 
retically motivated model of the AGN HOD to interpret 
the spatial clustering of quasars over a large range of 
scales. 

Our paper is organized as follows. In §2, we briefly 
describe our data sets. In §3, we present our estimate for 
the full projected 2PCF of SDSS quasars. The parame- 
terization of the quasar HOD and the theoretical model 
of the 2PCF are presented in §4. The HOD modeling 
of the 2PCF and results are presented in §5. Finally, 
we discuss our results in §6 and provide a summary in 
§7. Throughout the paper we assume a spatially flat, 
ACDM cosmology: fl,„ = 0.26, Qa = 0-74, Qh = P-0435 , 
n, = 0.96, as = 0.78, and h = 0.7 (jSpergel et al.ll2007D . 
We quote all distances in comoving h~^ Mpc and masses 
in units of h~^ Mp,. All m agnitudes are in the AB95 
system (jFukugita et al.lll996[ ). 

2. DATA 

The projected 2PCF measurements are drawn from 
three quasar clustering samples, a low-redshift sample 
(median redshift 1.4) from SDSS DR7, a high-redshift 
sample (median redshift 3.2) from SDSS DR5, and a bi- 
nary quasar sample (median redshift 1.6) for small-scale 
clustering. The samples are summarized in Table [T] and 
described below. 

2.1. SDSS Quasars 

The DR7 spe ct roscopic quasa r catalog 
(jSchneider et all 120101 : IShen et al.l 1201 It ) serves as 
the parent data set from which we build our clustering 
sample. This catalog contains 105,783 spectroscopically 
confirmed quasars with Mi < —22.0 over an area of 
9380 deg2 and a redshift range of 0.065 < z < 5.46. In 
order to construct a homogeneous clustering sample, 
we only consider quasars over the 6248 deg^ covered by 
the fi nal target selection algorithm fe.g.. iRichards et all 
l2002t ). Quasars at 2; < 3.0 are flux limited to i < 19.1, 
while higher-redshift quasars, which are fainter and 
rarer, are flux limited to i < 20.2. Additionally, we 
restrict our clustering sample to quasars in the redshift 
range 0.4 < z < 2.5. Our clustering sample then consists 
of 47,699 quasars flux limited to i < 19.1, with a median 
redshift of z = 1.4. 

At high redshifts, IShen et al.l (|2007l . hereafter SH07) 
have nieasured the projected 2P CF of quasars in SDSS 
DR5 (jAdelman-McCarthv et all l2007t ISchneider eFall 
|2007( ). The sample consists of 4426 spectroscopically 
identified quasars in the redshift interval 2.9 < z < 5.4, 
with a median redshift of z = 3.2. This sample is 
un iformly selected using the same targeting algorithm 
as IRichards et al.l (|2002[ ). S H07 calculate the proje cted 
2PCF using the estimator of lLandv fc Szalavl (|1993D and 
adopting a maximum line-of-sight separation of TTmax = 
100.0 h~^ Mpc. They measure the 2PCF over the trans- 
verse separation range 1.3 h~^ Mpc < r^ < 211 h~^ Mpc. 
We refer the reader to SH07 for a fuU discussion. 

2.2. Binary Quasars 

HE06 present measurements of the projected 2PCF of 
binarv auasars to seoarations as small as r„ '^ 20 h~^ 
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Fig. 1. — Projected 2PCF of the low-redshift quasar sample 
(median redshift 1.4). Triangles represent our measurements with 
the SDSS DR7 data. The original estimates from HE06 with the 
binary quasar sample are shown as light filled circles. Our cor- 
rection to the HE06 estimates by adopting TTmax = 80 h~^ Mpc, 
as used in our SDSS DR7 measurements, leads to the dark filled 
circles. The dashed vertical line indicates the fiber collision scale 
for the median redshift. 

kpc. The clustering sample spans the redshift range 
0.7 < z < 3.0 with a median redshift of z w 1.6. In 
studying binary quasars, HE06 include in their cluster- 
ing sample only same-redshift quasar pairs, defined as 
pairs with velocity differences of | Au |< 2000 km s~^. 
HE06 detect faint companions around a parent sample 
of 52,279 SDSS DR3 quasars and 6879 2QZ quasars with 
matching SDSS photometry. They also identify binary 
quasars from a photometric sample of 273,287 SDSS DR3 
quasar candidates, but we do not discuss them here be- 
cause these pairs were not included in the clustering sam- 
ple. In order to construct a sample of close quasar pairs 
large enough for a clustering study, HE06 employ four 
target selection algorithms, each with a different flux 
limit and varying degree of completeness, applied over 
different angular separation scales. We briefly discuss 
these algorithms below. We refer the reader to HE06 for 
a full discussion. 

For angular separations oi 6 < 3", pairs are identi- 
fied by fitting a multi-component point-spread function 
to atlas imag es of each quasar in a subset of 39,142 SDSS 
quasars fe.g.- lPindor et all 120031: llnada et al.ll2003D . For 
angular separations Z" < 9 < 60", HE06 employ a color 
similarity st atistic exploiting the q uasar color-redshift re- 
lation (e.g., IRichards et al.ll2001l ). In both algorithms, 
companion objects are fiux limited to i < 21.0 for the 
follow-up spectroscopy required to confirm the candidate 
quasar pair. At separations oi > 60", HE06 restrict 
their search to the 52,279 quasars in the SDSS spectro- 
scopic catalog, where an observation al completeness of ^ 
95% has been measured for i < 19.1 (jVanden Berk et aTJ 
l2005f ). No additional selection criteria are imposed be- 
yond those of the SDSS primary target selection algo- 
rithm. 

3. THE FULL PROJECTED TWO-POINT CORRELATION 
FUNCTION 

For the high-redshift sample, we adopt the measure- 
ments in SH07. For the low-redshift sample, however, 
we combine two sets of measurements to form the full 
oroiected 2PCF of auasars at z = 1.4. We measure the 
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projected 2PCF of quasars in SDSS DR7 on scales above 
1 h~^ Mpc. To accurately measure the projected 2PCF 
on scales below 1 h~^ Mpc, we make use of the binary 
quasar sample in HE06, with necessary corrections. 

To measure the large-scale 2PCF of SDSS DR7 
quasars, we follow the stand ard practice (e.g., 
IZehavi et alll2005t IShen et al]|2007[ ) to generate random 
catalogs using the angular selection function and redshift 
distribution of the DR7 uniform quasar sample. We com- 
pute the redshift-s pace 2PCF £a(?'pi tt) u sing the Landy- 
Szalay estimator tTandv &: Szalavlll9 93'). with rp and tt 
the transverse and line-of-sight pair separations, respec- 
tively. The projected 2 PCF Wp is obtained by integrating 
along the line of sight (iDavis fc Peebles! 1 1983f) , 



TABLE 2 
The Full Projected 2PCF of Low- 2; {z ■■ 



1.4) SDSS Quasars 



Wp{rp 



(s{rp, 7r)d7r, 



(1) 



where we adopt an upper limit of TTmax = 80 h~^ Mpc 
for the line-of-sight separation. To estimate errors on 
the 2PCF we use jackknife resampling as described in 
IShen et all (|2007[ ) with twenty-five jackknife samples. 
The measurements are shown as triangles with error bars 
in Figure[T] A more detailed analysis of quasar clustering 
based on SDSS DR7 quasars as functions of redshift and 
physical properties will be presented in a forthcoming 
paper (Shen et al., in preparation). 

The 2PCF measurements obtained from the DR7 sam- 
ple become unreliable at pair separations ^ 1 /i~^ Mpc 
because of fiber collisions. We overcome this limitation 
using data from HE06 for small scales, enabling us to 
measure the 2PCF down to transverse separations of 
^ 0.02 h^^ Mpc. We correct for the dominant selection 
difference between our samples as outlined below and in 
the Appendix. 

On average, quasars in the HE06 sample are one mag- 
nitude fainter and at slightly higher redshifts compared 
to those in our SDSS DR7 sample. There is also a dif- 
ference in the maximum line-of-sight separation, 7r,nax, 
used in calculating the projected 2PCF. In the original 
HE06 estimates, TTmax corresponds to ^ 22 h^^ Mpc, 
while our DR7 measurements adopt TTmax = 80 /i~^ Mpc. 
Figure [T] shows the combined effect of these sample se- 
lection differences on the projected 2PCF measurements. 
Over transverse separations 1-7 h~^ Mpc, the scales over 
which the two sets of measurements overlap, our SDSS 
DR7 measurements (triangles) exceed the HE06 mea- 
surements (light filled circles) by an average factor of 
2.3. The bulk of this discrepancy can be attributed to 
the lower value of TTmax used in HE06, which causes the 
offset to increase with rp. This effect is less significant on 
small scales {rp <C 1 h~^ Mpc) because the contribution 
to the projected 2PCF from pairs with large line-of-sight 
separations is small, allowing the clustering signal to be 
well recovered even with a smaller TTmax- 

In the Appendix, we discuss in detail our correction 
for the difference in TTmax- The corrected IIE06 projected 
2PCF measurements, indicated by the dark filled circles 
in Figure [1] are consistent with the SDSS DR7 data to 
within 1(7. HE06 only catalog projected quasar pairs 
(on which our correction relies) with angular separations 
< 90", so our correction cannot be applied to the data 
points above ^ 1.5 h^^ Mpc. Combining the corrected 
IIE06 measurements for rp < 1 h^^ Mpc with our SDSS 
DR7 measurements for r„ > 1.0 h^^ Mdc. we obtain an 



rp 


Wp 


«)p[HE06] 


5wp 


<5«)p[HE06] 


0.025 


14270 


(14020) 


9158 


(8928) 


0.037 


6864 


(6803) 


3892 


(3795) 


0.054 


3668 


(3688) 


2122 


(2068) 


0.080 


3588 


(1783) 


1894 


(1386) 


0.118 


1842 


(1128) 


1110 


(888.4) 


0.175 


1497 


(417.6) 


751.5 


(453.4) 


0.258 


221.6 


(235.0) 


242.2 


(211.6) 


0.381 


1026 


(636.2) 


323.8 


(251.0) 


0.564 


496.0 


(169.4) 


192.2 


(118.1) 


0.833 


839.8 


(248.6) 


126.8 


(71.08) 


1.334 


100.3 




29.16 




1.884 


62.82 




30.66 




2.661 


26.35 




19.41 




3.758 


47.19 




13.06 




5.309 


35.87 




8.942 




7.499 


31.83 




6.844 




10.593 


20.73 




6.668 




14.962 


10.00 




3.988 




21.135 


10.78 




3.216 




29.854 


5.283 




1.543 




42.170 


0.5126 




1.255 




59.566 


1.531 




0.8258 




84.140 


0.8502 




0.7377 




118.850 


0.6947 




0.4851 





Note. — The values of 



and Swp (error bar) are all 



in units of h ^ Mpc. The third and fifth columns are from the 
original measurements of HE06 below 1 h~^ Mpc (see the text). 

estimate of the full projected 2PCF of SDSS quasars, 
which is presented in Table 2. The uncertainties on each 
measurement are listed under Swp. For comparison, we 
also list the original measurements of IIE06 in parenthe- 
ses. We discuss the robustness of this small-scale 2PCF 
estimate in §6.1 and its possible luminosity evolution in 
§6.2. 

4. THEORETICAL FRAMEWORK FOR MODELING 
QUASAR CLUSTERING 

In this paper, we interpret the quasar 2PCF measure- 
ments within the HOD framework. The HOD provides a 
complete description of the relation between quasars and 
dark matter at the level of individual virialized halos. In 
the following sections, we introduce our quasar HOD pa- 
rameterization and describe the calculation of the 2PCF 
from the HOD. 

4.1. Halo Occupation Distribution of Quasars 

In analogy t o the galaxy HOD (e.g., 

iBerlind fc Weinberg] I2002D . the quasar HOD is de- 
fined by P{N\M), the conditional probability that a 
halo of virial mass M contains N quasars above some 
specified luminosity threshold, and by the relative 
spatial and velocity distributions of quasars within 
halos. In principle, P{N\M) could be fully specified by 
determining all its moments observationally from the 
quasar clustering at each order. In reality, however, 
quasar samples are too sparse to reliably measure 
hieher-order clusterine statistics. For our Duroose of 
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modeling the 2PCF, we only need the description of the 
first two moments, {N{M)) and {N{N - 1))m- Here 
we assume that the quasar HOD depends on halo mass 
alone, i.e., the quasar content of halos at a given mass is 
statistically independent of the large-s cale environments 
within which those ha los reside (e.g.. iBond et al.l 119911 : 
iLemson fc KauffmannI [1999) . W e neglect the assembly 
bias effect fe.g.. lGao et al.ll20Q5[ ). which should be small 
for the massive halos that typically host quasars. 

We represent the quasar mean occupation function as 
the sum of its physically illustrative central and satellite 
components, (N(M)) = (N^.^JM)) + (N,,,tiM)) (e.g. 



Kravtsov et ar]l2004 Izheng et al.l[2005l : IChatteriee et al.l 



2012|) . Our parameteriza t ion is largely based on the 
model of IChatteriee et all (|2012D . which results from a 
study of low-lumino sity AGN in a c osmological hydro- 
dynamic simulation ()Di Matteo et al, 2008b ). The mean 
occupation function is given as a softened step function 
for the central component plus a rolling-off power law for 
the satellite component, 



(NiM)) 



1 + erf 



logA/ — logA/ii 



M 
Ml 



exp 



i^logM 
Mcut 

M 



(2) 



The model admits five free parameters: A/n^in, the char- 
acteristic mass scale at which on average half of the ha- 
los host one quasar at the center of each halo; CTiogAf, 
the characteristic transition width of the softened step 
function; Mi, the approximate mass scale at which ha- 
los host, on average, one satellite quasar; a, the power 
law index of the mean satellite occupation function; and 
Mcut, the mass scale below which the satellite mean oc- 
cupation decays exponentially (see §6.3 for a discussion 
of alternative HOD models). We assume that the halo 
occupations of central and satellite quasars are indepen- 
dent. That is, for a given halo, the occupation of satellite 
quasars does not depend on whether there is a central 
quasar in the halo. This as s umpti on is motivated by the 
results in IChatteriee et al.l (|2012( ) , who find no correla- 
tion between the activities of central and satellite AGN 
in a hydrodynamic simulation. 

For a given halo mass, satellite galaxies and AGN in 
simulations are found to follow a Poisso i i distibution (e.g, 
Zheng et al.1 120051: iDegraf et al.ll20Tlbl: IChatteriee et alj 
2012( ). Hence we adopt a Poisson distribution for the 
satellite occupation number and a nearest integer distri- 
bution (Berlind & Weinberg 2002) for the central quasar 
occupation number. These are used to compute the sec- 
ond moment of the occupation number in the theoretical 
model of the 2PCF (see below). The above HOD param- 
eterization is the fiducial model used in this paper, but in 
§6.3 we also explore several alternate paramcterizations 
to test our results. 

For the spatial distribution of sa tellite quas ars within 
halos, we assume an NFW profile (iNavarro et al. 1997) 
with the concentration-mass relation (jZheng et al.ii2007i) 



c{M, z) 



Co 



l + z 
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where Af, is the nonlinear mass for collapse at z = 0, 
Co is the concentration parameter, and /? = —0.13. We 
adoDt cn = 25 for modeling z ~ 1.4 auasars. Assuming 



that the concentration increases with redshift, we adopt 
Co = 45 for modeling z ~ 3.2 quasars. These values are 
motivate d by the high conce ntration seen in observations 
of AGN ()Lin fc MoliiJl2007D . We have verified that our 
modeling only weakly depends on co for a wide range of 
values, from ^ 10 to ^ 60. We note that an NFW profile 
with high concentration is similar to a power law with 
index -3, which has been seen in the spatial distribution 
of black hole s and low-luininosity AGN in hydrody namic 
simulations (jDegraf et al.ll2011bl: IChatteriee et alj[2012.) . 

4.2. 2PCF Calculation 

The quasar 2PCF, ^g(r), gives the excess probability 
above random of finding a pair of quasars separated by a 
distance r. It can be represented in terms of the contribu- 
tions from intra-halo pairs, ^ihif), and inter-halo pairs, 
^2h{f)- The inter-halo or two-halo term is approximated 
as fe.g- lBerlind fc Weinbergl2002D 



^2k{r) 



dM-^{NiM))b^{M) 



1 2 



Uir), (4) 



where Uq is the quasar number density, dn/dM is the 
differential halo mass function, bh{M) is the halo bias 
factor, and Cmii^) is the 2PCF of matter. We identify 
the bracketed term as the quasar linear bias factor, bq. 
The intra-halo or one-halo term is expressed as 



l + ^ih{r) 



47rn^r^ 



dM— (N (N 
dM ^ ^ 
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where, for halos of mass Al, FM{r) is the average fraction 
of same-halo pairs at separations < r. The two-halo 
term depends only on {N{M)), while the one-halo term 
depends on {N (N — 1))^,/ and the radial profile of the 
spatial distribution of quasars through Fm (r) ■ 

In detail, the 2PCF calculation in our model is more 
complicated than the above expressions, as it includes 
the effects of halo exclusion, nonlinear clustering, and 
scale-dep endent halo b ias. It follows the method pro- 
posed injTinker et aU ()2005[ ). which improves the algo- 
rithm in [Zheng (2003) by incorporating a more accurate 
treatment of the halo exclusion effect. Halos in our cal- 
culation are defined as objects with mean density of 200 
times that of the background universe. The halo mass 
function is cq r npute d according to the formula given by 
iJenkins et al.l ()2001[ ). Fo r the large scale halo bias factor, 
we adopt the formula in lTinker et al.l (|2005l ). 

5. HOD MODELING AND ANALYSIS 

Using the routine developed in lZheng et al.l (j2007[ ). we 
perform a Markov Chain Monte Carlo (MCMC) model- 
ing of the projected 2PCF to sample the five-dimensional 
parameter space of the quasar HOD. We adopt flat pri- 
ors in logarithmic space for Mmin, Mi, and Afcut and in 
linear space for a and CTiog m (ciog m > 0) . We addition- 
ally require Mcut > 10^^ /i"^ Mg and 0.5 < a < 4.0, 
as motivated by theoretical studies and computational 
efficiency. We fit both the 2PCF and the abundance 
(number density) of quasars to constrain the HOD. The 
quasar number density and its associated u ncertainty are 
obtained from the luminosity function of iShen fc Kellvl 
(|2012l ). evaluated at the flux limit and median redshift 
of each samole. Since our clustering samoles are soarse. 
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we calculate x^ values using only the diagonal elements 
of the covariance matrix. Each MCMC chain contains 
100,000 points in the parameter space for modeling the 
low-redshift and high-redshift samples. 

5.1. Best-Fit HOD 

In Figure [5] we show the HOD of SDSS quasars at 
z ^ 1.4. Panel (a) shows the projected 2PCF measure- 
ments of our low-redshift clustering sample against the 
theoretical prediction of our best-fit HOD model (dot- 
ted line). We identify the best-fit model as the point 
in our five-dimensional parameter space associated with 
the global x^ minimum. If we rank the x^ of ^-U th^ 
models in ascending order, the first 68% of models give 
a range of predicted Wp indicated by the shaded en- 
velope in Figure [2l We note that Poisson errors are 
quoted on data points below the SDSS fiber collision 
scale. In applying our correction for TTmax, the Poisson 
errors quoted by HE06 are adjusted to reflect the addi- 
tional pair counts (see Table 2). Poisson counting errors 
assume that each quasar pair is statistically independent 
of all other pairs in the sample, which breaks down on 
large scales {rp > 1 h~^ Mpc) because of correlations 
between pairs in different radial bins. This leads to an 
underestimate of the uncertainty (e.g., SH07). Hence we 
excluded the single HE06 data point at rp > 0.8 h~^ Mpc 
from our modeling, which does not statistically alter the 
results. Our HOD model reproduces the clustering with 
a reduced x^ of 1.05. 

Panel (b) of Figure [2] shows the mean occupation func- 
tion from the best-fit HOD model, decomposed into its 
central (dashed line) and satellite (dot-dashed line) com- 
ponents. Similar to panel (a), the shaded regions indi- 
cate the range of the mean occupation function given 
by the 68% of models with the smallest x^- As seen in 
panel (b) , the satellite occupation becomes significant at 
mass scales above ~ lO^'' h~^ M©. This indicates that, 
at low redshift, typically only the most massive halos 
host multiple quasars. Panel (c) shows the full host halo 
mass distributions for central (dashed line) and satel- 
lite (dot-dashed line) quasars. It is derived by multiply- 
ing the mean occupation function of central (or satellite) 
quasars with the differential halo mass function. For a 
randomly chosen central (or satellite) quasar, the curve 
gives the probability distribution of its host halo mass. 
The curves are roughly log-normal for both central and 
satellite quasars. Finally, panel (d) shows the probabil- 
ity density function for the satellite fraction /sat from all 
models in the MCMC chain. The satellite fraction is de- 
fined as the ratio of the number density of satellites (inte- 
grated over all halo masses) to the total number density 
of quasars (including both central and satellite quasars). 
At the 68% confidence level, we find the satellite fraction 
of z - 1.4 SDSS quasars to be /^at = (7.4 ± 1.3) x lO""*. 

5.2. Redshift Evolution 

By performing HOD modeling of the 2PCFs for both 
the z = 1.4 and z — 3.2 quasar samples, we are able to 
study the evolution of the quasar-halo relation. 

Figure [3] shows the redshift evolution of the SDSS cen- 
tral quasar population. In panel (a), triangles represent 
the projected 2PCF measurements of SH07 at high red- 
shift and circles indicate our measurements from DR7 at 



low redshift. The best-fit theoretical predictions are rep- 
resented by the dotted (z = 1.4) and dashed (z — 3.2) 
lines. As before, the shaded regions are the envelopes 
from predictions of the 68% of models with the smallest 
values of x^- Out best-fit model reproduces the high- 
redshift clustering with a reduced x^ of 1.19. Panel (b) 
shows the best-fit mean occupation function of central 
quasars at z = 1.4 (dotted line) and z = 3.2 (dashed 
line). The mean occupation function steepens consid- 
erably with redshift over intermediate halo mass scales 
(- 10^3 _ 1014 /i-i Mq). The high-redshift occupation 
number exceeds the low-redshift one by a factor > 10 
above masses of ~ lO^'^'^ h~^ M©. Since the cutoff pro- 
file reflects the scatter in the relation between halo mass 
and quasar luminosity, the above steepening implies that 
quasar luminosity is more tightly correlated with halo 
mass at higher redshift. 

The mean occupation function shown in panel (b) of 
Figure [3] can also be interpreted as the mass-dependent 
duty cycle of quasars (i.e., the fraction of halos with 
an active central quasar). At each redshift, we esti- 
mate an average duty cycle /q for central quasars around 
the median host halo mass by averaging the mean cen- 
tral occupation number over the central 68% of the host 
halo mass distribution. From all the models in the 
MCMC chains, we infer that, at the 68% confidence 
level, /q = 7.3t?:5 x 10^" for z - 1.4 quasars and 
/q = 8.6^^2'' X 10"^ for z ~ 3.2 quasars. 

Panel (a) of Figure |4] shows the normalized host halo 
mass distributions for central quasars at z = 1.4 (solid 
line) and z = 3.2 (dashed line). At high redshift we find 
both a narrowing of the distribution and a shift to higher 
masses. Panel (b) shows the probability density func- 
tion for the median halo mass at each redshift. At the 
68% confidence level, denoted by the vertical lines under 
each curve, we find the characteristic host halo masses at 



= 1.4 and z = 3.2 to be Mr, 



i.l+y, X 10^2 /i-i M© 



and Afcon = 14-11:6:9 x 10^^ h^'^ M©, respectively. There 
is tentative evidence for a lower quasar host halo mass 
at lower redshift, or the "downsizing" of quasar activ- 
ity, but we note that this mass evolution is only sig- 
nificant at the ~ 1.5tT level. For satellite quasars, the 
characteristic median host halo mass at z = 1.4 is 
Afsat = 3.6t?:o X 10" h-^ M©. The characteristic host 
halo mass for z = 3.2 quasars is not well constrained. 
We note that a luminosity difference exists between the 
high- and low-redshift samples. However, we find that 
the luminosity dependence of the clustering is not able 
to produce such a large shift in halo mass (see §6.2). 

6. DISCUSSION 

We now discuss systematic issues and theoretical as- 
pects of our analysis. 

6.1. Robustness of the Small-Scale Correlation Function 

At Tp > 1 h~^ Mpc, quasars in both the HE06 sam- 
ple and the SDSS DR7 sample are uniformly selected by 
the same targeting algorithm, indicating our correction 
for TTniax should be complete. We have also verified that 
the redshift distribution of the corrected HE06 cluster- 
ing sample is not appreciably altered from the original 
redshift distribution of HE06 sources. The consistency 
between the corrected HE06 measurements and the DR7 
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Fig. 2. — Fit to the 2PCF of f = 1.4 quasars and the quasar HOD. Panel (a): our estimate of the full projected 2PCF of SDSS quasars 
(data points and error bars) against the prediction of our best-fit HOD model (dotted line). Panel (b): the mean occupation function 
of SDSS quasars, decomposed into its central (dashed line) and satellite (dot-dashed line) components. In both panel (a) and (b), the 
shaded envelopes indicate the 68% confidence intervals (see the text). Panel (c): the probability distribution of host halo masses, shown 
for both central (dashed line) and satellite (dot-dashed line) quasars. These distributions are obtained by multiplying the mean occupation 
functions with the differential halo mass function, averaged over all the models in the MCMC chain (see §5.1 for discussion). Panel (d): 
the probability density function of the satellite fraction as given by all our HOD models. In panels (c) and (d), the vertical dashed lines 
indicate the central 68% for each distribution. 



data then suggests that the effect on clustering from the 
remaining sample differences in redshift and himinosity 
does not exceed the \a leveL 

At Tp < 1 h^^ Mpc, we must examine the completeness 
of our correction for the larger TTmax, as the projected 
pair counts can only reflect the candidates identified 
by each selection algorithm for fohow-up spectroscopy. 
Roughly 80% of the sub-arcminute same-redshift quasar 
pairs were detected using the color-redshift relation (see 
HE06). Extending the line-of-sight separation limit from 
~ 22 /i~^ Mpc to ^ 80 /i^^Mpc corresponds to a red- 
shift difference of only Az ~ 0.04. For such a small 
redshift difference, the intrinsic dispersion and photo- 
metric redshift errors continue to dominate the scatter 
in the color-redshift relation, marginalizing the effect of 
our correction. This suggests that our correction with 
the SDSS DR7 TTmax is reasonably complete. 

A second selection effect involves differences in lumi- 
nosity threshold. The sub-arcminute clustering sam- 
ple contains companion quasars primarily targeted to a 
fainter flux limit (i < 21.0) than that of the SDSS par- 
ent sample (i < 19.1). Although the luminosity evo- 
lution of small-scale quasar clustering has not yet been 
investigated, studies of large-scale clustering {rp > 1 h"^ 



Mpc) have detected weak to no luminosity e volution over 
simila r redshift and lum inosity ranges (e.g.. [Mvers et ahl 
l2007at IShen et al.l[2009t ). Assuming that the small-scale 
clustering evolves with luminosity similarly to large-scale 
clustering, we estimate that the amplitude of the pro- 
jected 2PCF could be boosted, at most, by '^ 30% for 
a flux limit change from i < 21.0 to i < 19.1. This is 
within the statistical uncertainty of our modeling. 

Finally, we must ensure that including the small-scale 
clustering (i.e., the corrected HE06 estimates) does not 
alter the constraints given by the large-scale DR7 mea- 
surements. The small-scale clustering mainly constrains 
the satellite HOD, while the central quasar HOD is 
mainly constrained by the large-scale clustering and the 
number density. We have verified that our modeling 
yields an identical central occupation function regardless 
of whether the small-scale data are included. 

6.2. Redshift and Luminosity Evolution of the HOD 

In order to have the strongest statistical power, our 
clustering samples are constructed over a broad redshift 
and luminosity range to maximize the volume and the 
number of sources. The 2PCF obtained in this way can 
be interoreted as an averaee over the redshift and lu- 
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Fig. 3. — Redshift evolution of the central quasar HOD. Panel 
(a): projeeted 2PCF measurements and jaekknife errors of SH07 
(triangles) and those from SDSS DR7 (cireles), together with the 
best-fit model prediction (dotted for 2; ~ 1.4 and dashed for z ~ 
3.2). Panel (b); the best-fit mean occupation function of central 
quasars at z ~ 1.4 (dotted curve) and z ~ 3.2 (dashed curve). In 
both panels, the shaded region around each best-fit model shows 
the envelope determined from the 68% of models with the smallest 
X^ values. 



Fig. 4. — Redshift evolution of the host halo mass for central 
quasars. Panel (a): the full mass distribution of central quasar 
host halos, which is obtained by multiplying the central mean oc- 
cupation function with the differential halo mass function at each 
redshift (see § 5.1), averaged over all the models in the MCMC 
chain. Panel (b): probability density function of the median mass 
of halos hosting central quasars. The distribution is obtained by 
computing the median host halo mass for each model in the MCMC 
chain. In both panels, vertical lines denote the 68% confidence in- 
tervals. 



miiiosity intervals. However, interpreting the HOD as 
an average is problematic. Our HOD modeling uses 
halo properties (e.g., mass function, bias factor) and the 
quasar space density at the median redshift. Redshift 
evolution of the halo properties and quasar space den- 
sity can lead to systematic effects that exceed the sta- 
tistical uncertainties reflected in our modeling results. 
For a full interpretation of the measured 2PCF, all the 
evolution and selection effects should be properly incor- 
porated into the model, which would require additonal 
HOD parameters. Given the above effects, it can seem 
that the interpretation of our HOD modeling result is not 
straightforward. However, we find that our modeling re- 
sults can be meaningfully interpreted as representing the 
HOD for quasars at the median redshift of the sample 
(to within the quoted uncertainties), as explained and 
tested below. 

The true 2PCF for quasars at the median redshift is 
that measured over a narrow redshift range around the 
median redshift. If the 2PCF measured over a wider red- 
shift range around the median redshift (i.e., the average 
2PCF) is statistically consistent with the true 2PCF for 
the median redshift. it would be reasonable to use the 



measured average 2PCF to represent the true 2PCF at 
the median redshift. The HOD modeling of such a 2PCF 
measurement using the halo properties and quasar abun- 
dance at the median redshift would then lead to HOD 
constraints that can be regarded as those for quasars at 
the median redshift. 

To test whether our 2PCF measured over the full red- 
shift range can be regarded as that at the median red- 
shift, we divide the DR7 quasar sample into two subsam- 
ples above and below the median redshift and measure 
the projected 2PCF for each subsample. We find that the 
2PCFs of the two sub-samples are consistent with each 
other and with the 2PCF of the full sample (i.e., the one 
we model) . This indicates that the 2PCF measured from 
the full sample over the wide redshift range can indeed 
be regarded as the 2PCF of quasars at the median red- 
shift, supporting our modeling of the measurements at 
this single redshift. Since we are not in a position to 
evaluate this assumption for the HE06 or SH07 samples, 
we assume the clustering evolves weakly on small scales 
and at high redshift, similarly to the large-scale cluster- 
ing at low-redshift. This is not a strong assumption given 
the laree relative errors on the corrected HE06 and SHOT 
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measurements (typically > 50%) , which we expect to be 
sufficiently large to account for any biasing from redshift 
and/or luminosity evolution. 

From our modeling, we find that the median mass of 
central quasar host halos at z = 1.4 is a factor of ^ 3.5 
lower than that at z = 3.2. Although the SDSS targets 
quasars at 2; > 3.0 to a fainter flux limit (z < 20.2) than 
those at z < 3.0 (z < 19.1), the high-redshift quasars 
in SH07 are still typically a factor of ^ 2.8 more lu- 
minous than those in our low-redshift DR7 sample. To 
check whether the luminosity difference can account for 
the shift in host halo mass scales, we perform the fol- 
lowing te st. If we adopt the luminosity dependent 2PCF 
results in IShen et al.l (|2009[ ). we find that increasing the 
median luminosity of the low-redshift quasars to match 
that of the high-redshift quasars can, at most, boost the 
amplitude of the low-redshift 2PCF by 50%. HOD mod- 
eling of this boosted 2PCF indicates that the median 
mass of halos hosting central quasars is still a factor of 
2.7 lower than that of the high-redshift sample. There- 
fore, we conclude that the difference in quasar luminosity 
cannot account for the halo mass scale evolution. 

The decrease in this mass scale from z~3.2toz^l.4 
suggests some "downsizing" pattern in the evolution of 
the typical mass of quasar-hosting halos towards lower 
redshift, although the evidence is mild, significant only 
at the 1.5tT level. If the luminosity depende nce of clus- 
tering at high redshift is stronger than that in lShen et al.l 
()2009| ). the significance will be further reduced. Recently, 
some semi-analytic models and hydrodynamic simula- 
tions have shown that the luminosity evolution of quasar 
clustering is s tronger at higher redshifts (e.g., Shen 2009; 
ICrotonI [2009( 1 . For such a case, the luminosity differ- 
ence in our low- and high-redshift samples can result in 
a larger difference in host halo masses and the downsiz- 
ing pattern may largely reflect a selection bias. Future 
measurements with larger quasar samples can better con- 
strain the evolution of the quasar host halo mass. 

6.3. Alternate HOD Parameterizations 

Our parameterization of the quasar HOD is based on a 
physical mo del of AGN evolution fr om one hydrodynamic 
simulation (jChatteriee et al.1l2012[ ). However, given the 
simulation volume, the model was limited to the case of 
low-luminosity AGN. Applying a similar parameterized 
form to quasars is a large extrapolation, so it is necessary 
to investigate whether our results depend on the partic- 
ular HOD parameterization we have chosen and whether 
the HOD form is flexible enough to model the quasar 
2PCF. 

In our fiducial HOD model, the central mean occupa- 
tion function approaches unity at sufficiently large mass 
[see eq. ([U], although in the modeling results it does 
not reach unity over the range of physical halo masses. 
To check whether our modeling results are affected by 
this asymptotic feature in our parameterization, we re- 
peat our modeling and allow the softened step function to 
approach an arbitrary asymptotic value less than unity. 
With this additional degree of freedom, we find that the 
best-fit HOD is statistically identical to that of the orig- 
inal five-parameter model at both redshifts, indicating 
that our fiducial HOD parameterization has the flexibil- 
ity similar to the six-parameter model in modeling the 
auasar 2PCFs here. 



As a further test, we keep the above additional de- 
gree of freedom and modify the cutoff profile by chang- 
ing the 1 -(- erf term in the central occupation function to 
[1 -|- erf]^. Again, we are able to obtain good fits to the 
2PCF measurements. Compared to the fiducial model, 
this modified model leads to a different amplitude of the 
mean central quasar occupation function for halos above 
2^013.5 ^-1 j^^_ Pqj. 2 ^ ^j^ quasars, the amplitude from 
the best-fit model increases by a factor of ^^ 2, while for 
z ^ 3.2 quasars, it drops by a factor of ~ 15. For the 
z ~ 1.4 quasars, the mass scale for satellite quasars shifts 
by about 0.5 dex towards the high mass end. 

However, since massive halos are rare, the host halo 
mass distributions for central quasars remain statistically 
identical to those from the fiducial model (see Figure |4]) , 
indicating that our constraints on the typical host halo 
mass of central quasars are robust. The test suggests 
that the projected 2PCF has little constraining power 
for the central quasar HOD in high-mass halos. 

As an additional test of the lack of constraining power 
for the HOD in high mass halos, we introduce a sharp 
cutoff in the fiducial model {Nccn{M)) for halos of mass 
M > Merit and leave the form of {Nsa,t{M)) unchanged. 
The motivation of such a high-mass cutoff is related to 
the decline in star forma tion for central galaxies in halos 
of > 10" /i-i Mq (e.s.. iConrov fc WechsleH[2009h — ff 
both star formation and quasars are fueled by a common 
reservoir of cold gas, a decline in star formation would 
imply a decline in quasar activity in massive halos. For 
two tests with Mcnt^lO" h'^ Mq and 10^^^ h^^ Mq 
we find equally good fits to the 2PCF (as compared to 
our fiducial modeling). The inferred satellite fractions 
and median host halo mass scales for central quasars are 
consistent with the results from the fiducial model within 
the uncertainty. The tests clearly show that the 2PCF is 
not sensitive to the central quasar HOD in massive halos 
{M > 10" h~'^ Mq). It also demonstrates that our 
results in central quasar halo mass scale are not strongly 
affected by the assumed form of the high- mass end of the 
HOD. 

Unlike for central quasars, the satellite quasar HOD 
is affected more by the parameterization. The satel- 
lite fraction inferred from our fiducial model therefore 
has some systematic uncertainty, although those from 
the modified models remain consistent with that from 
the fiducial model at the la level. It is certain, how- 
ever, that at least a small fraction of quasars must be 
satellites to produce the small-scale clustering. Could 
all quasars be satellites? Although this satellite-only 
scenar io is not consistent w ith observations of luminous 
AGN (jStarikova et al.|[2011[ ). we perform a test by setting 
(A'cen(Af )) = and leaving the rolloff powe r-law form of 
(Nsat iM)) unchanged so that /sat = 1 fe.g.. lMivaii et al.l 
l2011t ). We find that the model is unable to simultane- 
ously reproduce both the large- and small-scale cluster- 
ing, ruling out the possibility that all quasars are satel- 
lites (at least for the rolloff power-law form of the HOD 
adopted here). 

Recently, af t er our initial submission of the paper, 
iKavo fc Oguril (|2012[ ) reported a satellite fraction for 
z ^ 1.4 qusasars ~ 100 times higher than our inference, 
also based on HOD modeling of the 2PCF. They pa- 
rameterize the mean occupation functions of central and 
satellite quasars to have the same shape and only differ in 
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normalization. Both follow a Gaussian curve {N(M)) in 
logarithmic space of halo mass with fixed width (~ 0.77 
dex in a full- width- half- maximum sense), and the relative 
normalization is given by the mass-independent satellite 
fraction f^at such that (iVce„(M)) = (1 - Ut){N{M)) 
and (7Vsat(M)) = /sat(A^(M)). The two free parameters, 
fsat and the halo mass scale are constrained by fitting 
the number density of quasars and the 2PCF. In such a 
parameterization, {Nccn{M)) is similar to our test case 
with a high-mass cutoff, but {Nsat{M)) is substantially 
different. 

Although there is no compelling reason to believe that 
(Nr.p.rtJM)) an d (Ni,f, t(M)) have the same shape, the 
iKavo fc Qguril ()2012D model, differing substantially in 
{Nsat{M)) from ours, can lead to good fits to the 2PCF. 
This indicates that there exists large degeneracy in the 
satellite HOD constrained from the 2PCF. Therefore, 
compared to the central quasar halo mass scale, the satel- 
lite HOD and satellite fraction are less robustly deter- 
mined. Additional observables are necessary to break the 
degeneracy. Since the satellite mass scales are different 
in our model and theirs, the distribution of the line-of- 
sight velocity differences of q uasar pairs can be us ed to 
differentiate the two models. iKavo fc Oguril (|2012| ) tab- 
ulate the relative line-of-sight velocities of quasar pairs 
in their sample. Although the current data are not pre- 
cise enough, we make an attempt anyway to compare the 
velocity distribution with model predictions. It appears 
that neither model is favored by the data. While our 
model produces too many pairs of large relative line-of- 
sight velocity (caused by putting too many satelltes in 
massive halos), theirs tends to underpredict such pairs. 
The true solution may lie in between the two models, and 
a larger sample of small-separation quasar pairs would 
help to definitively constrain the satellite HOD. In con- 
clusion, it is necessary to keep in mind the caveat of 
large degeneracy in interpreting the satellite HOD con- 
straints in our and their modeling results. These degen- 
eracies underscore the importance that future modeling 
employ physically-motivated parameterizations that al- 
low the clearest possible interpretations. 

Finally, we consider a parameterization in which the 
central quasar mean occupation function is modeled as a 
sharp step function with the amplitude as a free parame- 
ter. That is, this model assumes a constant duty cycle for 
central quasars. It also has five free parameters, but we 
find it is unable to reproduce the clustering on one-halo 
scales as a result of insufficient central-satellite pairs. As 
a caveat, we note that in our model we independently 
assign central and satellite quasars to halos (i.e., assume 
no correlation between the activities of the central and 
satellite black holes within a given halo; see §4.1). If 
major mergers trigger quasar activity, for example, there 
may be an enhanced probability of close quasar pairs 
in group-sized halos, suggesting satellite black holes are 
more likely to become active in a halo with an active cen- 
tral black hole. In such a case, preferentially assigning 
satellites to halos containing central quasars in principle 
may create sufficient central-satellite pairs to reproduce 
the one-halo clustering. However, we reserve a more de- 
tailed investigation for future work. Conservatively, we 
conclude that the constant duty cycle scenario for central 
quasars is excluded if the occupation statuses of central 
and satellite quasars are uncorrelated. 



Overall, we find that our fiducial parameterization is 
able to well reproduce the observed quasar clustering and 
our modeling results on the mass scales of central quasars 
do not seem to be an artificial effect from the parame- 
terization. 

7. CONCLUSIONS 

In this paper we present the first estimate of the pro- 
jected 2PCF of SDSS quasars a.t z ^ 1.4 over the full one- 
and two-halo scales. We perform HOD modeling of the 
projected 2PCF to obtain tight constraints on the rela- 
tion between central and satellite quasars and their host 
dark matter halos. We also model the projected 2PCF 
of SDSS quasars at high redshift (z ~ 3.2) measured by 
SH07 to investigate the redshift evolution of the HOD. 

The key results from our study include: 

(1) We measure the projected 2PCF of 2: '--^ 1.4 
SDSS quasars over a large range of scales, 
0.02 /i^i Mpc < rp < 120 h'^ Mpc, by combin- 
ing measurements from the SDSS DR7 data with 
those from the HE06 binary quasar sample (with 
appropriate corrections) . 

(2) Our quasar HOD model, with its parameterization 
motivated by AGN evolution simulation, is able to 
interpret the projected 2PCFs of SDSS quasars on 
all scales and at both z ^ 1.4 and z ^ 3.2. 

(3) For the z ~ 1.4 SDSS quasar sample, we model 
the projected 2PCF over all scales to obtain the 
tightest constraints to date on the median host halo 
mass scales, M^en = 4,-^-oa x 10^^ ^"^ ^G for 
central quasars and il/gat = 3.6+5;o x 10^** h~^ Mq 
for satellites. 



(4) A small fraction of z ~ 1.4 SDSS quasars need 
to be satellites within dark matter halos to re- 
produce the small-scale clustering. Under our pa- 
rameterization, we obtain the first estimate of the 
satellite fraction of 2; ~ 1.4 SDSS quasars, /sat — 
(7.4 ±1.4) X lO"-*. 

(5) The median host halo mass for z ^ 3.2 cen- 
tral quasars is inferred to be Mccn = 14.l!tgg x 
10^^ h~^ Mq, about 3.5 times higher than that for 
z ^ 1.4 central quasars (a 1.5cr result). 

(6) The cutoff profile of the mean occupation function 
of central quasars steepens considerably with red- 
shift. Since the cutoff profile refiects the scatter in 
the relation between halo mass and quasar luminos- 
ity, this steepening implies that quasar luminosity 
is more tightly correlated with halo mass at higher 
redshift. 

(7) The average duty cycles for central quasars resid- 
ing in halos around the median host halo mass are 
inferred to be /q == 7.3^51 x lO"'^ at z - 1.4 and 

/q 



8.6+?°/ X 10-2 



at z '^ 3.2. 



The median mass scales inferred from our HOD analy- 
sis are consistent with the curre nt paradigm of galaxy 
and quasar co-evolution (e.g., iHopkins et al.l 



ICrotonl I200I iHickox et all 20091). In this Daradigm. a 
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quasar phase and rapid growth of the stellar bulge of 
the galaxy occurs when the host halo reaches a critical 
mass between 10^^ — lO^^M©. Our results indicate a sim- 
ilar mass scale for both low- and high-redshift quasars, 
but more data is needed to precisely measure the red- 
shift evolution of the host mass scales. A more accurate 
measurement of the mass scale will provide the neces- 
sary input for future developments and tests of quasar 
evolution theory. 

Although the exact mechanism for triggering quasar 
activity is not fully know n (for a recent review, see 
[Alexander fc Hickoxl |2012[ ) , some of the leading candi- 
dates incl ude higher gas inflow rates frorn . gas-rich galaxy 
merge r s (iKauffmann fc Haehneltl |2000U ISpringel et al.l 
l2005at iHopkins et aL 2006b ), secular processes an d mi- 
nor mergers (e.g., Kormendv & Kennicutt '2004), and 
steady cold flows (Di Mattco et al. 2012 ). Better mea- 
surements of small-scale clustering at different redshifts 
may provide a useful way to understand triggering pro- 
cesses. For example, a purely merger-driven model 
will predict excess power at small scale s compared to 
a model with only s ecular p rocesses fe.g..|Hopkins et al.l 
l2008HThacker et al. 2009; D egraf et al.ll2011a^ . The halo 
mass scales inferred from HOD modeling can be used to 
predict the small-scale clustering of a population (e.g., 
galaxies) dominated by secular processes. With such 
a control sample, the relative small scale excess would 
provide important information on the role of mergers in 
triggering quasar activity. In a purely cold flow-driven 
model, quasar activity has a speciflc dependence on halo 
mass. Hence the mass scale constrained by modeling 
clustering measurements is also useful for testing such 
models. 

We obtained a low satellite fraction for z ~ 1.4 quasars, 
while that for z ^ 3.2 quasars is not well constrained 
given the data. The redshift evolution of the satellite 
fraction, if constrained, can have important implications 
for black hole mergers and the binary AGN population. 
We find that the 2PCF alone may not be able to pro- 
vide accurate and unbiased constraints on the satellite 
fraction of quasars, so other observables like the quasar 
pairwise velocity distribution would be useful for tight- 
ening the constraints. 

The quasar duty cycle inferred from our study strongly 
varies with redshift s imilarly to predictions from semi- 
analytic models (e.g.. lShenll2009[ ). A fair comparison be- 
tween the exact values of duty cycles in different works 
is not straightforward, given the subtle differences in 
the definition and in the averaging. Broadly speaking. 



while we obtain a lower duty cycle at z ^ 1.4, the 
du ty cycle at z 7 ^ 3.2 is consistent with the predictions 
of IShankar et al.l ()2010aD . who use a phenomenological 
model to describe the statistical properties of quasars. 
IChatteriee et al.l ()2012f ) find a similar redshift evolution 
in the active fraction of low-luminosity AGN. Using a 
hydrodynamic simulation they show that feedback from 
the black hole suppresses its own growth, lowering the 
number density of AGN at low redshifts for co mpara- 
ble luminosities. Recently, IShankar et al.l (|2010b[ ) inves- 
tigated the degeneracy in theoretical interpretation of 
the duty cycle with the quasar luminosity-host halo rela- 
tion. Putting aside the subtlety in the exact definition, 
our low- redshi f t value of the duty cycle is consistent with 
IShankar et al.l ()2010b[ ). In this model, a higher value of 
duty cycle corresponds to a lower scatter in the quasar 
luminosity-halo mass relation. Our results therefore sup- 
port the picture of a lower scatter between halo mass and 
quasar luminosity at higher redshift. 

Understanding the impact of AGN on cosmology and 
galaxy evolution is one of the key science drivers be- 
hind large-scale numerical simulations and future multi- 
wavelength surveys (e.g., BigBOSS, eROSITA, LSST, 
SDSS-IH/BOSS). The HOD framework, capable of ex- 
ploiting the statistical power of clustering in these sim- 
ulations and survey data sets, is expected to provide in- 
creasingly informative tests of AGN evolution. 
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APPENDIX 

CORRECTING THE SMALL-SCALE CORRELATION FUNCTION 

Because of the low pair counts at small separations, HE06 calculate their clustering measurements in terms of a 
dimensionless, volume- and redshift-averaged quantity, W ^. They use the estimator 



1 -I- M^p(i?min,^n 



kQQ) 



(Al) 



where (QQ) is the data-data pair count assigned to the transverse radial bin [-Rmim ^max] and (Q-R) is the data-random 
pair count assigned to the same bin. The method of assigning (QQ) to each radial bin varies with targeting algorithm 
and lOJi) depends on a selection function defined to account for sources of incompleteness. The proiected 2PCF. w„. 
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is recovered by integrating Wp along the line of sight, 

WpiRmin.RmnKjdTr 

a(z)H[z) 

where we note Vmiix/[0'{z)H{z)] ^ TTmax- HE06 set Wmax, the maximum velocity difference for a pair, to 2000 km s~ , 
which gives a rescaling factor of 27rniax ^ 44 h~^ Mpc. 

Correcting the Data-Data Pair Count 

To match the TTmax used in the 2PCF measurement of the DR7 quasars, we correct the projected 2PCF measurements 
of HE06 by including the (previously excluded) projected quasar pairs with hne-of-sight separations < 80.0 h^^ Mpc, 
the upper line-of-sight separation limit of our DR7 measurements. HE06 publish a catalog of all their excluded 
projected pairs with a redshift difference of Az < 0.5. Since 7r,„ax = 80 h~^Mpc corresponds to Az ^ 0.04, this 
sample is complete for our purposes. In applying the correction, we first select the projected pairs from these catalogs 
satisfying all the selection criteria of the particular targeting algorithm which identified them, excluding the velocity 
difference requirement | Av \< 2000 km s^^. We then calculate the line-of-sight separation for each selected pair of 
quasars from their spectroscopic redshifts. Each projected pair with n < 80.0 h~^ Mpc is added to the appropriate 
transverse radial bin. 

However, a subtlety arises in assigning the additional pair counts. For sub-arcminute pairs, the companion quasar 
is typically targeted to a fainter magnitude than its SDSS parent. Since auto-clustering measurements require a 
uniformly fiux-limitcd sample, the usual interpretation of (QQ) and (QR) as pair counts cannot be applied. Instead, 
HE06 define (QQ) and (QR) in terms of the number of companions about quasars in the parent samples. As we will 
show below, (QR) is then d efined as dependent on the quasar luminosity function ('e.g.. lBovle et al.|[2000HCroom et al.l 
120041 : [Richards et al.l [2005[ ) to account for flux differences between the companion samples. For most quasar pairs 
selected at subarcminute separations, the parent and companion samples are distinct and the additoinal pair count is 
just the number of projected pairs we add to the bin. However, for overlapping pairs discovered in the parent SDSS 
+ 2QZ catalog, the parent and companion samples are identical. In this case the additional pair count is twice the 
number of projected pairs we add to the bin, since each of the two objects in the pair has a companion in the parent 
sample. We refer the reader to HE06 for a full description of this method. 

Correcting the Data-Random Pair Count 

We now turn our attention to the data-random pair count, (QR). Since companion quasars at < 60" are targeted 
to magnitudes fainter than the SDSS flux limit, random mock catalogs cannot be used to estimate (QR) for each radial 
bin. Instead, HE06 separately compute the data-random contribution {QR)k for each selection algorithm k in terms 
of a selection function accounting for sources of incompleteness, 

{QR)k = Yl "(^J' * < ^'k)VshciiS{zj, 9j), (A3) 

where n{zj, i < i'^,) is the number density of quasars above the flux limit i'f, of selection algorithm k, T^hoii is the 
cylindrical volume probed by the radial bin in redshift space, S{zj, 6j) is the selection probability function, and 
Nqso. k is the total number of quasars in the parent sample of selection algorithm k. For each radial bin, the total 
(QR) is then obtained by summing over all contributions. In this calculation, only the cylindrical redshift-space volume 
probed by each radial bin, 14hcii = 27r(i?^ax ~ ^min)'''max, significantly depends on our choice of TTmax- Since, for each 
radial bin, (QR) oc V^hoU, our correction factor is given simply by the ratio of TTmax values used in computing the 
projected 2PCF for our SDSS DR7 sample and the HE06 sample (80 h'^ Mpc/22 h'^ Mpc w 3.6). 

Recalculating the Small-Scale Correlation Function 

With our corrected pair counts, we recalculate the small-scale projected 2PCF. First, we calculate the dimensionless 
clustering statistic Wp using the estimator of HE06 [see eq. (jAll) ]. Then, we recover Wp by integrating along the line 
of sight to TTniax = 80 h~^ Mpc, the line-of-sight separation limit for our SDSS DR7 sample. Our complete correction 
to the projected 2PCF measurements of HE06 is given as 

( {QQ)c 
\{QR)c 

where {QQ)c is the corrected data-data pair count for each radial bin as described above and {QR)c — 
(7''max/7''max,o)(Q-R) is the corrcctcd data-random pair count, with TTmax = 80 h~^ Mpc and TTmax, o = 22 h^^ Mpc. Fi- 
nally, we re- estimate t he Po isson errors using the corrected data-data counts. Following HE06, we use the uncertainties 
tabulated in lGehrelsl ()1986D for bins with {QQ)c < 30. 
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